On the Use of Spatial Cues to Improve Binaural Source Separation
نویسندگان
چکیده
Motivated by the human hearing sense we devise a computational model suitable for the localization of many sources in stereo signals, and apply this to the separation of sound sources. The method employs spatial cues in order to resolve high-frequency phase ambiguities. More specifically we use relationships between the short time Fourier transforms (STFT) of the two signals in order to estimate the two most important spatial cues, namely time differences (TD) and level differences (LD) between the sensors. By using models of both free field wave propagation and head related transfer functions (HRTF), these cues are combined to form estimates of spatial parameters such as the directions of arrival (DOA). The theory is validated with the help of the experimental results presented in the paper.
منابع مشابه
Effect of spatial separation on speech-in-noise comprehension in dyslexic adults
This study tested the use of binaural cues in adult dyslexic listeners during speech-in-noise comprehension. Participants listened to words presented in three different noise-types (Babble-, Fluctuatingand Stationary-noise) in three different listening configurations: dichotic, monaural and binaural. In controls, we obtained an important informational masking in the monaural configuration mostl...
متن کاملTime-Frequency Masking for Blind Source Separation with Preserved Spatial Cues
In this paper, we address the problem of speech source separation by relying on time-frequency binary masks to segregate binaural mixtures. We describe an algorithm which can tackle reverberant mixtures and can extract the original sources while preserving their original spatial locations. The performance of the proposed algorithm is evaluated objectively and subjectively, by assessing the esti...
متن کاملMonaural Source Separation Using Spectral Cues
The acoustic environment poses at least two important challenges. First, animals must localise sound sources using a variety of binaural and monaural cues; and second they must separate sources into distinct auditory streams (the “cocktail party problem”). Binaural cues include intra-aural intensity and phase disparity. The primary monaural cue is the spectral filtering introduced by the head a...
متن کاملRole of binaural hearing in speech intelligibility and spatial release from masking using vocoded speech.
A cochlear implant vocoder was used to evaluate relative contributions of spectral and binaural temporal fine-structure cues to speech intelligibility. In Study I, stimuli were vocoded, and then convolved through head related transfer functions (HRTFs) to remove speech temporal fine structure but preserve the binaural temporal fine-structure cues. In Study II, the order of processing was revers...
متن کاملContributions of binaural processing to segregating and selecting speech in a complex sound mixture
Intuitively, we all believe that binaural processing plays a critical role in communication, especially at the venerable “cocktail party.” Indeed, if you attend a poster session at a large conference (like the ICA), close your eyes, plug one ear, and try to follow a scientific discussion, you will experience the importance of having two ears. Here we will discuss how binaural processing contrib...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003